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Abstract — A construction using the E8 lattice and Reed-Solomon 
codes for error-correction in flash memory is given. Since E8 lattice 
decoding errors are bursty, a Reed-Solomon code over GF(2 8 ) 
is well suited. This is a type of coded modulation, where the 
Euclidean distance of the lattice, which is an eight-dimensional 
signal constellation, is combined with the Hamming distance of the 
code. This system is compared with the conventional technique for 
flash memories, BCH codes using Gray-coded PAM. The described 
construction has a performance advantage of 1.6 to 1.8 dB at a 
probability of word error of 10 6 . Evaluation is at high data rates 
of 2.9 bits/cell for flash memory cells that have an uncoded data 
density of 3 bits/cell. 

I. Introduction 

While single-level flash memory stores a single bit per memory 
cell, data density can be increased by using two, three or more bits 
per cell [1] [2]. Along with this increase in density, the influence 
of noise also increases. Numerous error-correcting approaches 
have been considered, although BCH codes are predominant in 
practice. 

A conventional multilevel-flash chip uses Gray-coded PAM and 
presents hard decisions externally. Because the signal-to-noise 
ratio in flash can be characterized as high, Gray-coded PAM 
results in single bit errors. Further, errors in flash memories tend to 
be uncorrelated. Thus, the errors may be characterized as random, 
isolated errors of one bit, and flash memory systems (such as SSD) 
use BCH error correcting codes of high rate. The decoder in such 
chips is a hard-input BCH decoder, implemented in an external 
chip. 

In carrier-based AWGN systems, two-dimensional constella- 
tions such as QPSK and QAM are used. Trellis-coded modulation 
is a low-complexity method to improve the performance by 
combining the Euclidean distance of the constellation with the 
Hamming distance of a convolutional code, or a more powerful 
error-correcting code [3]. In fact, trellis-coded modulation with 
convolutional codes and a QAM constellation (PAM over two 
dimensions) has been considered for general memories [5], and 
have been evaluated for flash memories [4]. In carrier-based 
systems, two-dimensional constellations aid with synchronization 
at the receiver, but this is not needed with flash memories. 

This paper considers the use of lattices — that is, higher dimen- 
sional constellations — for error-correction in flash memories. In 
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particular, the E8 lattice, which is the best-known lattice in eight 
dimensions, is considered for flash memory. While E8 has greater 
minimum Euclidean distance than PAM, an outer error-correcting 
code is still needed to guarantee data reliability. Whereas trellis- 
coded modulation uses a convolutional code, in this paper Reed- 
Solomon codes, which are a type of block code, are used. 

Because E8 decoding induces burst-like errors, Reed-Solomon 
(RS) codes constructed over GF(2 8 ) are used for error correction. 
One RS code symbol is encoded to eight memory cells; a group 
of eight memory cells represents one lattice point. When an E8 
decoding error occurs, with high probability it will be one of the 
240 neighboring lattice points. Thus, the constellation (lattice) 
to codeword mapping needs only to distinguish between these 
neighbors. Only the modulo-2 value of the lattice points are 
protected by the RS codes; the Euclidean separation of the lattice 
points is also important. 

The E8 lattice has a number of desirable properties. It is the 
the best-known lattice in eight dimensions, in the sense of having 
the densest packing, highest kissing number and being the best 
quantizer [12, p. 12]. It also has an efficient decoding algorithm. 
The lattice generator matrix is triangular, which makes it suitable 
for encoding. In addition, the E8 lattice points are either integers 
or half-integers; for implementations, this may be more suitable 
than writing arbitrary values to memory. 

Previously, the E8 lattice has been considered for trellis-coded 
modulation with a convolutional code and average power shaping 
[6]. In addition, there are numerous proposals for including RS 
codes in trellis-coded modulation [7] [8]. However, the combi- 
nation of the E8 lattice and RS codes described in this paper 
appears to be unique. For error-correction in flash memories, 
BCH codes have received the most attention; however recently 
RS codes, which can be constructed over a smaller field for the 
same block length, have also been considered for flash memory 
[9]. At high rates, the rate loss of RS codes compared to BCH 
codes is relatively small, and the smaller field size makes RS 
decoding more efficient. 

II. Background 

This section gives some background. First the assumed channel 
model is described, followed by a brief overview of lattices 
in general. Then, the E8 lattice and some of its properties are 
described. 



A. Channel Model 

The assumed model has N flash memory cells, which can store 
an arbitrary value between and V. The uncoded information rate 
is log 2 q bits per cell. In conventional multilevel flash memory, 
this is accomplished by choosing V = q — 1 and storing one of 
{0,1,..., q — 1} in each cell. For lattices of dimension n, the 
lattice encodes q n levels, or nlog 2 q bits in n cells. 

The flash reading and writing process introduces noise, which 
is modeled as additive white Gaussian noise with mean zero 
and variance a 2 . A more advanced model, which assigns higher 
variances to the levels and q — 1 is difficult to apply to lattice 
coding, since the cells store continuous values (see for example 
[4]). The channel SNR used in this paper is: 



YL 

a 2 : 



(1) 



where V 2 represents the peak signal energy. 

B. Lattices and Lattice Codes 

An n-dimensional lattice A is an infinite set of points x = 
(x\,X2i ■ ■ ■ > x nY defined by an n-by-n generator matrix G, for 
which 



Gb, 



(2) 



where b = (bi, . . . , &„)* is from the set of all possible integer 
vectors, b{ € Z. The i, j entry of G is denoted ay. The set A 
forms a group under addition in R™, so lattices are linear in the 
sense that the sum of any two lattice points is a lattice point. 

The minimum of the squared Euclidean distance between any 
two distinct lattice points is the minimum norm. The lattice points 
at this distance from the origin are the minimum vectors. The 
number of points at this distance is called the kissing number, 
denoted r. Because of the linearity of lattices, all lattice points 
have the same number of nearest neighbors, corresponding to the 
minimum vectors. The packing radius p is taken to be half the 
square root of the minimum norm. In terms of finite-field codes, 
the minimum vectors are analogous to the minimum-weight non- 
zero codes words, and the packing radius is analogous to half the 
minimum distance. When the lattice is scaled by a, its generator 
matrix is aG and the packing radius is ap. 

A practical coding scheme must select a finite subset of the 
lattice; a codebook may be constructed by the intersection of 
a shaping region B and the lattice A. In general, codebook 
construction is difficult, but when B is an n-cube and G is a 
triangular matrix, there exists a practical method [10] [11]. In 
particular, let B be an n-cube, given by: 

< xt < M, (3) 

and let a = (ax, ... , a n ) be information-containing integers, with 

(4) 



{0,1,..., — - 1} 

9a 



for i = 1, . . . , n. The diagonal elements must satisfy the condition 
that M/ga is an integer. It is convenient to assume that M/ga is 
a power of two, to aid encoding from bits to integers. 



In general the lattice point G ■ 
encoding finds a vector k = ( — 
b: 



a is not in B. Instead, the 
. . . , -z*-), which determines 



Mk, 



such that 



Gb 



(5) 



(6) 



is in the cube B. Because the generator matrix is lower triangular, 
the ki can be found by solving the inequality (3): 



(7) 



for ki, which is unique. From the triangular structure of G, first 
k\, then k-2, ■ ■ ■ ,k n are found in sequence. In particular: 



9ii ai 



(8) 



where the computation at step i depends upon b\, . . . , 6j_x- 

Since the power constraint for flash memory is cubic, one might 
expect that this encoding scheme is sufficient, by choosing M = 
V. Unfortunately, the strict upper inequality in (3) is necessary for 
ki to be unique. As a result, it is not possible to find lattice points 
for which x t = AI, which reduces the size of the codebook. But 
this problem is resolved for the E8 lattice in the next section. 

Decoding is straightforward. If x is a lattice point, and b = 
G _1 x, then a is found as: 

a,; = bj mod — . (9) 



9i 



C. E8 Lattice 



The E8 lattice is an eight-dimensional lattice, which can be 
described in terms of the D8 checkerboard lattice [12, p. 120]. 
The D8 lattice points are all integers that have an even sum. The 
E8 lattice is the union of the D8 lattice, and a coset of the D8 
lattice: 



where, 



E8 = D8 U D8 + | 



\ = (v 2 , x h, v 2) v 2 , v 2 , v 2 , y 2 , v 2 )- 



(10) 



(11) 



A generator matrix is: 



G 



Va 
Va 
Va 
Va 
Va 
Va 
Va 
Va 



.(12) 



The kissing number r of the E8 lattice is 240. The minimal 
vectors are sequences (±1 2 , 6 ) (there are 4- Q such sequences) 

and (±V 2 ,±V 2 ,±V 2 ,±y 2 ,±V 2 ,±V 2 ,±V 2 ,±V 2 ), where there me 
an even number of minus signs (there are 2 7 such sequences). 

The packing radius p is 1/ \f2. 



The E8 lattice has an efficient decoding algorithm, which finds 
the lattice point x closest to an arbitrary point y G R 8 [13]. The 
decoding algorithm is low-complexity: it requires only rounding 
operations, wrong-way rounding of the least reliable position, and 
comparison of Euclidean distance. 

III. Proposed Construction 

This section describes the proposed construction. First, the 
codebook is created by scaling the lattice by a < 1. Then, the 
encoding is given, which partitions the flash memory into blocks 
of 8 cells, each encoded using an E8 lattice. Each block corre- 
sponds to one RS code symbol. Finally, the decoding algorithm is 
described, including a post-processing step which uses the lattice 
error patterns to recover the integer sequence. 

A. Lattice Codebook 

The lattice code is the set of lattice points which satisfy the 
constraint that the cell value is between and V (inclusive), and 
cell values correspond to lattice points. A significant point is that 
the encoding described previously does not allow encoding Xi = 
M, but the physical system allows xi = V. The lattice should be 
scaled so that the lattice code (that is, B n A) contains q 8 lattice 
points. For the E8 lattice, it is observed that choosing 

V 

(13) 



M 



V + 0.5 
= V+l 



and 



(14) 



will satisfy this rate criteria and the power constraint. To show this, 
consider an example with q = 4 and V — 3. Initially encode the 
unsealed E8 lattice using not AI = 3, but M = 4; this violates the 
power constraint. Because the E8 lattice consists of integer points 
and half-integer points, this encoding results in lattice points with 
a maximum value of 3.5. By scaling the lattice by 3/3.5 the half- 
integer lattice points are placed exactly on the boundary V. 

Using the unsealed lattice, the total number of lattice codewords 
that satisfy < Xj < V + 1 is for the E8 lattice is: 

(V + l)" 



(V + l)' 



(15) 



dot G\ 

The scaling does not change the number of lattice codewords. For 
convenience, encoding is done with the unsealed lattice, (12), and 
scaling is applied before writing to memory. 

The scaling reduces the separation between lattice points. For 
the E8 lattice and large q, p approaches the maximum value of 
\/\/2. For the small value of q = 4, a = 6/7 results in an 
effective packing radius of 0.606; this is higher 0.5, the packing 
radius of the PAM constellation. 

B. Encoding 

Consider A" flash memory cells. These N cells are separated 
into blocks, each block consists of 8 cells, so there are A/8 blocks. 
Let x denote the vector of blocks, that is: 



(xi,x 2 , . . .jXjv/s), 



(16) 



where each x» is a lattice point. 

A (n c ,k c ,t) shortened, systematic RS code constructed over 
GF(2 8 ) is used. Each RS code symbol is assigned to a block, 



so n c = N/8. Blocks to which systematic RS symbols and 
parity symbols are assigned are called systematic blocks and parity 
blocks, respectively. The block Xj is systematic for i = 1, . . . , k c 
and is parity for i = k c ,...,n c . The information integers are 
represented as: 



(ai,a 2 , 



(17) 



written to separate the integers in systemic blocks and integers in 
parity blocks. 

Encoding for systematic blocks is identical to the general lattice 
encoding described previously. For an example of q = 8, within 
any systematic block, if a is the vector of information integers, 
then ai G {0, 1, . . . , 15}, as G {0, 1, 2, 3} and the remaining six 
integers are from {0, 1, . . . , 7}. 

Encoding for parity blocks requires combining the computed 
RS parity checks and additional information integers. For each 
systematic block, compute: 



u ( = a, mod 2, 



(18) 



for i = 1, ...,k c , where u = a mod 2 means component- 
wise modulo-2. The eight bits (ui, . . . , u$) form a single GF(2 8 ) 
RS symbol. Using information ui, ...,Ufe c , compute the RS 
parity symbols pi, . . . , p nc -fec- For each parity block, p = 
(Pi,P2,---,Ps),Pi G {0,1}. 

To perform lattice encoding in the parity blocks, integers are 
formed where the least-significant bit is the RS parity, and the 
remaining part is information. The information integers in the 
parity block are ai G {0, 1, . . . , jjp}- For the example of q = 8: 
ai G {0,1,..., 7}, ag G {0,1} and the remaining six integers 
are from {0, 1, 2, 3}. The integers to be lattice encoded in parity 
block i are: 



Pi-k c 



(19) 



for i = k c + 1, . . . , n c , and addition is over the real numbers. 
Thus, additional information is embedded in the parity blocks, 
because only the LSB of the parity block is needed by the RS 
parity symbol. Note also that Gray coding is not used. 
The total number of encoded information bits is k, 

k = fc c -81og 2 g+(n c -fc c )(-8 + 81og 2 g) (20) 

and the total number of cells used is N — 8n c , so the code rate 
R = k/N, measured in bits per cell is: 



R 



— log 2 q 



1 + log 2 q) bits/cell. (21) 



C. Decoding 

The encoded lattice point is scaled by a = V/ (V+0.5), passed 
through an AWGN channel, and scaled by 1/a by the decoder. 
Let this received sequence be denoted by (yi, y 2 , . . . , y„ c ). 
Block-by-block E8 lattice decoding is performed [13], and the 
integer sequence (ai , . . . , a„ c ) is obtained. Compute u = a 
mod 2 and perform RS decoding. Consider any single block. 
If lattice decoding is successful, then the RS symbol will be 
correct. However, if lattice decoding is unsuccessful, with high 
probability, a minimal vector error is made, that is, the lattice 
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TABLE I 

Several of the 240 minimum norm error patterns for the E8 lattice. The vector u © u is sufficient to identify the error pattern, except 

FOR A SIGN CHANGE. 



decoder erroneously selects one of the 240 neighboring lattice 
points. 

If the number of block errors (that is, lattice decoding errors) is 
less than the error-correcting capability t of the RS code, then the 
blocks which have errors can be identified. But since the RS code 
only protects the least- significant bits, RS decoding alone cannot 
recover all the information. The distance properties of the lattice 
will be used. This may be regarded as a type of post-processing 
step. 

The difference a— a denotes the integer error pattern, and uffiu 
= |u — u| denotes the bit error pattern. Since u is provided by the 
RS decoder, u © u is known. A sample of some of the 240 error 
patterns is given in Table I. Except for a sign change throughout, 
the bit error pattern u © u uniquely identifies the integer error 
pattern. By employing a look-up table (much like Table I) this bit 
error pattern u © u can be mapped to an integer error pattern e, 
used to find a two new estimated integer sequences Si and &2'. 



ai = a + e, and 
a 2 = a e. 



(22) 
(23) 



Select the vector a that has the shortest distance from the received 
sequence y: 

a, ||a, - y|| < ||a 2 - y|| 
a2 otherwise 



a = 



(24) 



Thus, the entire integer sequence can generally be recovered, if 
the correct RS symbol is known. 

IV. Numerical Results 

The proposed construction is evaluated numerically. Com- 
parisons are made with the dominant conventional system for 
error-correction in flash memories, BCH codes using Gray-coded 
PAM. To show the benefit of the proposed construction, a fair 
comparison is made by selecting code parameters to encode about 
4096 bits, so that the number of cells N (and thus the code rate) 
are as similar as possible. 

For parameters m, t and s, there exists a systematic, shorted 
RS code of length n c = 2 m — 1 — s symbols, encoding k c = 2 m — 
1 — It — s information symbols, that can correct t symbol errors. 
For parameters m, t and s, there exists a systematic, shortened 
BCH code of length 2 m — 1 — s bits, encoding 2 m — 1 — mt — s 
information bits, that can correct t symbol errors. 

Fig. 1 shows the simulated word error probability for q = 8 
(3 bits per flash cell), and various code rates. The specific code 
parameters are in Table II. At a probability of word error rate 
of 10~ 6 , the uncoded E8 lattice has approximately 1.8 dB better 
performance than PAM. This benefit is preserved after coding, 
with gains of 1.6 to 1.8 dB are observed. Note that for each code 
comparison the error-correction capability of the RS and BCH 
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TABLE n 

RS AND BCH CODES CONSIDERED IN FIG. 1 
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Fig. 1 . Probability of word error for Reed-Solomon codes with E8 lattice, com- 
pared to BCH codes with Gray-coded PAM, for flash memory with q = 8 (uncoded 
3 bits/cell). The proposed construction has 1.6 to 1.8 dB better performance. The 
uncoded performance is also shown, where an error corresponds to a symbol (an 
E8 lattice symbol or a q-ary PAM symbol) being in error. 



code is essentially the same. 

It should be noted that the E8 lattice decoder is a soft-input 
decoder, whereas PAM decoding is hard decision. In both cases, 
the RS and BCH decoders are using hard inputs. In commercial 
flash memory products, the memory and error-correction functions 
in separate chips. While the E8 decoder has soft inputs, because 
it is a low-complexity decoder, future flash memory systems 
could implement a lattice decoder on the flash memory chip. 
Thus, the error-correction performance may be improved, without 
significant changes to existing system architecture. 
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